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Abstract 

Mechanical unfolding of the fourth domain of Distyostelium discoideum filamin (DDFLN4) was 
studied in detail using the Cq-Go model. We show that unfolding pathways of this protein depend 
on the pulling speed. The agreement between theoretical and experimental results on the sequenc- 
ing of unfolding events is achieved at low loading rates. The unfolding free energy landscape is 
also constructed using dependencies of unfolding forces on pulling speeds. 
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I. INTRODUCTION 



The last ten years have witnessed an intense activity in single-molecule force spectroscopy 
experiments in detecting inter and intramolecular forces of biological systems to understand 
their functions and structures. Much of the research has been focused on elastic properties 
of proteins, DNA, and RNA, i.e, their response tq_an external force, following the seminal 
papers by Rief et al. and Tskhovrebova et al. The main advantage of this technique 
is its ability to separate out the fluctuations of individual trajectories from the ensemble 
average behavior observed in traditional bulk biochemical experiments. This allows for 
studying unfolding pathways in detail using the end-to-end distance as a reaction coordinate. 
Moreover, the single-molecule force spectroscopy can be used to decipher the unfolding free 
energy landscape (FEL) of biomolecules 0, 4]. 

As cytoskeletal proteins, large actin-binding proteins play a key roles in cell organization, 
mechanics and signalling 5j . During the process of permanent cytoskeleton reorganization, 
all involved participants are subject to mechanical stress. One of them is the fourth domain 
Distyostelium discoideum filamin (DDFLN4), which binds different components of actin- 
binding protein. Therefore, understanding the mechanical response of this domain to a 
stretched force is of great interest. Recently, using the AFM experiments, Schwaiger et al. 

have obtained two major results for DDFLN4. First, this domain (Fig. [1]) unfolds 
via intermediates as the force-extension curve displays two peaks centered at the end-to-end 
extension AR ^12 nm and AR 22 nm. Second, with the help of loop mutations, it was 
suggested that during the first unfolding event (first peak) strands A and B unfold first. 
Therefore, strands C - G form a stable intermediate structure, which then unfolds in the 
second unfolding event (second peak). In addition, Schwaiger et al. j3] have also determined 
the FEL parameters of DDFLN4. 

With the help of the Cq-Go model 8], Li et al. {4] have demonstrated that the mechanical 
unfolding of DDFLN4 does follow the three-state scenario but the full agreement between 
theory and experiments was not obtained. The simulations ^ showed that two peaks in 
the force-extension profile occur at AR ^1.5 nm and 11 nm, i.e., the Go modeling does not 
detect the peak at AR ^ 22 nm. Instead, it predicts the existence of a peak not far from the 
native conformation. More iniportantly, theoretical unfolding pathways {4] are very different 
from the experimental ones 6|]: the unfolding initiates from the C-terminal, but not from 
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the N-terminal terminal as shown by the experiments. 

It should be noted that the pulling speed used in the previous simulations is about five 
orders of magnitude larger than the experimental value 6|. Therefore, a natural question 
emerges is if the discrepancy between theory and experiments is due to huge difference in 
nulling speeds. Motivated by this, we have carried low-t" simulations, using the Go model 
8|. Interestingly, we uncovered that unfolding pathways of DDFLN4 depend on the pulling 
speed and only at f ~ 10^ nm/s, the theoretical unfolding sequencing coincides with the 
experimental one Q|. However, even at low loading rates, the existence of the peak at 
AR ^ 1.5 nm remains robust and the Go modeling does not capture the maximum at 
AR ^ 22 nm. 

In the previous work j4|, using dependencies of unfolding times on external forces, the 
distance between the native state (NS) and intermediate state (IS) Xui, and the distance 
between the IS and denaturated state (DS) x,,o of DDFLN4 have been estimated. In the 

a 

Bell approximation, the agreement between the theory and experiments [7|] was reasonable. 
However, in the non-Bell approximation j^, the theoretical values of Xui, and Xu2 seem to 
be high ^. In addition the unfolding barrier between the first transition state (TSl) and 



NS AGl is clearly higher than its experimental counterpart (Tab 



9| holds for a three-state 



In this paper, assuming that the microscopic kinetic theory 
protein, we calculated Xui{i = 1,2) and unfolding barriers by a different method which is 
based on dependencies of peaks in the force-extension curve on v. Our present estimations 
for the unfolding FEL parameters are more reasonable compared to the previous ones Q]. 
Finally, we have also studied thermal unfolding pathways of DDFLN4 and shown that the 
mechanical unfolding pathways are different from the thermal ones. 



II. METHOD 



The native conformation of DDFLN4, which has seven /3-strands, enumerated as A to 
G, was taken from the PDB (PI: IKSR, Fig. [1^). We assume that residues i and j are 
in native contact if the distance between them in the native conformation, is shorter than 
a cutoff distance dc = 6.5 A. With this choice of dc, the molecule has 163 native contacts. 
Native contacts exist between seven pairs of /5-strands Pab; Paf, Pre, Pcd, Pcf, Pde, and 
Pfg (Fig. [lb). 
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We used the Cn,-Go model 



model are chosen as follows 



IC 
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for a molecule. The corresponding parameters of this 



1^: Kr = 100eH/A2, Kq = 20eH/rad2, K^^^ = en, and 



K 



(3) 



0.5eH, wher e en is the characteristic hydrogen bond energy, and C = 4 A. As in 



our previous works 
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111 ], we set en = 0.98 kcal/mol. Then, the temperature T = 285 K 



corresponds to O.bSen/kB and all computations have been performed at this temperature. 
The force unit is [/] = e^/A = 68 pN |lo|. 

The simulations were carried out in the over-damped limit with the water viscosity ( = 
50^ 12|, where the time unit = {ma? / enY^"^ ?5i 3 ps, m is a typical mass of amino-acids, 
and a = 4A a distance between two neighboring residues. Neglecting the inertia term, the 
Brownian dynamics equation was numerically solved by the simple Euler method. Due to 
the large viscosity, we can choose a large time step At = O.lr^, and this choice allows us to 
study unfolding at low loading rates. 

In the constant velocity force simulations, we fix the N-terminal and pull the C-terminal 
by applying the force f = Kr{vt — r), where r is the displacement of the pulled atom from 
its original position 13| , and the spring constant of cantilever Kr is set to be the same as the 
spring constant of the Go model. The pulling direction was chosen along the vector drawn 
from the fixed atom to the pulled one. 

The mechanical unfolding sequencing was studied by monitoring the fraction of native 
contacts of the /5-strands and of their seven pairs as a function of Ai?, which is admitted 
a good reaction coordinate. In order to probe thermal unfolding pathways, for the i-th 
trajectory we introduce the progress variable 5i = t/T^^, where is the unfolding time jlol |. 
Then one can average the fraction of native contacts over many trajectories in a unique 
time window < 5j < 1 and monitor the unfolding sequencing with the help of the progress 
variable 6. 



III. RESULTS 



A. Robustness of peak at AR pa 1.5 nm and absence of maximum at Ai? w 22 nm 
at low pulling speeds 

In our previous high pulling speed {v = 3.6 x 10'' nm/s) simulations Q], the force- 
extension curve shows two peaks at Ai? ^1.5 nm and 10 nm, while the experiments showed 
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that peaks appear at AR ?5i 12 nm and 22 nm. The question we ask if one can reproduce 

the experimental results at low pulling speeds. Within our computational facilities, we were 

able to perform simulations at the lowest v = 2.6 x 10"^ nm/s which is about three orders of 

fl 

magnitude lower than that used before |4|. 

Fig. [2] show force-extension curves for four representative pulling speeds. For the highest 
V = 7.2 X 10^ nm/s (Fig. [2^), there are two peaks located at extensions AR ^ 1.5 nm and 9 
nm. As evident from Figs. [2)d, c and d, the existence of the first peak remains robust against 
reduction of v. Positions of fmaxi weakly fluctuate over the range 0.9 ^ AR < 1.8 nm for 
all values of v (Fig. [3]). As f is reduced, fmaxi decreases but this peak does not vanish if one 
interpolate our results to the lowest pulling speed Vg^p = 200 nm/s used in the experiments 
[g] (see below). Thus, opposed to the experiments, the first peak occurs already at small 
end-to-end extensions. We do not exclude a possibility that such a peak was overlooked by 
experiments, as it happened with the titin domain 127. Recall that, for this domain the first 
AFM experiment l| did not trace the hump which was observed in the later simulations 

n n 

[13[ and experiments |14l |. 

Positions of the second peak fmax2 are more scattered compared to fmaxi, ranging from 
about 8 nm to 12 nm (Fig. [3]). Overall, they move toward higher values upon reduction 
of V (Fig. [2]). If at f = 6.4 x 10^ nm/s only about 15% trajectories display ARmax2 > 10 
nm, then this percentage reaches 65% and 97% for v = 5.8 x 10^ nm/s and 2.6 x 10^ nm/s, 
respectively (Fig. [3l). 

At low V, unfolding pathways show rich diversity. For v ^ 6.4 x 10^ nm/s, the force- 
extension profile shows only two peaks in all trajectories studied (Fig. [2^ and [2b), while for 
lower speeds v = 5.8 x 10^ nm/s and 2.6 x 10^ nm/s, about 4% trajectories display even 
four peaks (Fig. [2t and[2]i), i.e. the four-state behavior. 

We do not observe any peak at AR ^ 22 nm for all loading rates (Fig. [2]), and it is 
very unlikely that it will appear at lower values of v. Thus, the Go model, in which non- 
native interactions are neglected, fails to reproduce this experimental observation. Whether 
inclusion of non-native interactions would cure this problem requires further studies. 
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B. Dependence of mechanical unfolding pathways on loading rates 



The considerable fluctuations of peak positions and occurrence of even three peaks already 
suggest that unfolding pathways, which are kinetic in nature, may change if v is varied. To 
clarify this point in more detail, we show Ai?-dependencies of native contacts of all 13- 
strands and their pairs for v = 7.2 x 10^ nm/s (Fig. H]) and v = 2.6 x 10^ nm/s (Fig. [5]). 
For V = 7.2 X 10^ nm/s, one has the following unfolding pathways: 

G ^ F ^ {C,E,D) ^ B ^ A, (la) 

Paf Pre {Pfg, Pcf) Pcd Pde Pab- (lb) 

According to this scenario, the unfolding initiates from the C-terminal, while the experiments 
[gI showed that strands A and B unfold first. For v = 2.6 x 10^ nm/s. Fig. Ogives the 
following sequencing 

iA,B)^iC,D,E)^iF,G), (2a) 

Paf (Pbe, Pab) Pcf (Pcd, Pde, Peg)- (2b) 

We obtain the very interesting result that at this low loading rate, in agreement with the 
AFM experiments j^l, the N-terminal detaches from a protein first. 

For both values of v, the first peak corresponds to breaking of native contacts between 
strands A and F (Fig. IDd and Fig. \5jp). However, the structure of unfolding intermediates, 
which correspond to this peak, depends on v. For = 7.2 x 10^ nm/s (Fig. HI), at AR 1.5 
nm, native contacts between F and G are broken and strand G has already been unstructured 
(Fig. |4^). Therefore, for this pulling speed, the intermediate consists of six ordered strands 
A-F (see Fig. [6^ for a typical snapshot). In the v = 2.6 x 10^ nm/s case, just after 
the first peak, none of strands unfolds completely (Fig. [5k), although (A,F) and (B,E) 
contacts have been already broken (Fig. \5jp). Thus, the intermediate looks very different 
from the high v case, as it has all secondary structures partially structured (see (Fig. [6b) 
for a typical snapshot). Since the experiments showed that intermediate structures 
contain five ordered strands C-G, intermediates predicted by simulations are more ordered 
than the experimental ones. Even though, our low loading rate Go simulations provide the 
same pathways as on the experiments. The difference between theory and experiments in 
intermediate structures comes from different locations of the first peak. It remains unclear if 
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this is a shortcomiiiEf of Go models or of the experiments because it is hard to imagine that 

n 

a /5-protein hke DDFLN4 displays the first peak at such a large extension AR ^ 12 nm [6|. 
The force-extension curve of the titin domain 127, which has a similar native topology, for 
example, displays the first peak at AR ^ 0.8 nm [l^. From this prospect, the theoretical 
result is more favorable. 

The strong dependence of unfolding pathways on loading rates is also clearly seen from 
structures around the second peak. In the v = 7.2 x 10^ nm/s case, at AR ^ 11 nm, strands 
A and B remain structured, while other strands detach from a protein core (Fig. |4] and Fig. 
Eb). This is entirely different from the low loading case, where A and B completely unfold 
but F and G still survive (Fig. Oand Fig. [^). The result, obtained for v = 2.6 x 10"^ nm/s, 
is in full agreement with the experiments 6j that at AR 12 nm, A and B detached from 
the core. 

Note that the unfolding pathways given by Eq. [Tal [Tb| [2a] and [2b] are valid in the 
statistical sense. In all 50 trajectories studied for v = 7.2 x 10^ nm/s, strands A and B 
always unfold last, and F and G unfold first (Eq. [Tal) . while the sequencing of unfolding 
events for C, D and E depends on individual trajectories. At f = 2.6 x 10^ nm/s, most 
of trajectories follow the pathway given by Eq. [2al but we have observed a few unusual 
pathways, as it is illustrated in Fig. [71 Having three peaks in the force-extension profile, 
the evolution of native contacts of F and G display an atypical behavior. At AR 7 nm, 
these strands fully unfold (Fig. [Tj^), but they refold again at AR ^11 nm (Fig. [7b and[7|i). 
Their final unfolding takes place around AR ^ 16.5 nm. As follows from Fig. [7b, the first 
peak in Fig. [7^ corresponds to unfolding of G. Strands A and B unfold after passing the 
second peak, while the third maximum occurs due to unfolding of C-G , i.e. of a core part 
shown in Fig. [Tli. 

The dependence of unfolding pathways on v is understandable. If a protein is pulled very 
fast, the perturbation, caused by the external force, does not have enough time to propagate 
to the fixed N-terminal before the C-terminal unfolds. Therefore, at very high v, we have 
the pathway given by Eq. [Tal In the opposite limit, it does matter what end is pulled as 
the external force is uniformly felt along a chain. Then, a strand, which has a weaker link 
with the core, would unfold first. 
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C. Computation of FEL parameters 



As mentioned above, at low loading rates, for some trajectories, the force-extension curve 
does not show two, but three peaks. However, the percentage of such trajectories is rather 
small, we will neglect them and consider DDFLN4 as a three-state protein. Recently, using 
dependencies of unfolding times on the constant external force and the non-linear kinetic 



theory 



3], we obtained distances x„i ~ Xu2 ~ IsA ^. These values seem to be large for (3- 



proteins like DDFLN4, which are supposed to have smaller Xu compared to a//3- and a-ones 



Iq . A clear difference between theory and experiments was also observed for the unfolding 



barrier AG\. In order to see if one can improve our previous results, we will extract the 
FEL parameters by a different approach. Namely, assuming that all FEL parameters of 
the three-state DDFLN4, including the barrier between the second transition state and the 
intermediate state AG\ (see Ref. 4 for the definition), can be determined from dependencies 
of fmaxi and fmax2 ou V, wc Calculate them in the the Bell-Evans-Rirchie approximation as 
well as beyond this approximation. 

Estimation of Xui and x„2 in the Bell-Evans-Rirchie approximation 
In this ap proximation, Xui and Xu2 are related to v, fmaxi and fmax2 by the following 
equation 



16[: 



fr, 



In 



X7, 



vx,. 



1,2, 



(3) 



where kuii'^) is unfolding rates at zero external force. In the low force regime (t; ^ 2 x 10^ 
nm/s), the dependence of fmax on v is logarithmic and Xui and Xu2 are defined by slopes of 
linear fits in Fig. [Hi Their values are listed in Table 1. The estimate of Xu2 agrees very well 
with the experimental [3] as well as with the previous theoretical result {4]. The present 
value of Xui agrees with the experiments better than the old one {4]. Presumably, this is 
because it has been estimated by the same procedure as in the experiments [3]. 

It is important to note that the logarithmic behavior is observed only at low enough 
V. At high loading rates, the dependence of fmax on v becomes power-law. This explains 
why all-atom simulations, performed at w ~ 10^ nm/s for most of proteins, are not able to 
provide reasonable estimations for Xu- 

The another interesting question is if the peak at Ai? ^ 1.5 nm disappears at loading 
rates used in the experiments [3]. Assuming that the logarithmic dependence in Fig. [8] 
has the same slope at low we interpolate our results to Vexp = 200 nm/s and obtain 
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fmaxiivexp) ~ 40 pN. Thus, in the framework of the Go model, the existence of the first 
peak is robust at experimental speeds. 

Beyond the Bell-Evans-Rirchie approximation 

In the Bell-Evans-Rirchie approximation, one assumes that the location of the transition 
state does not move under the action of an external force. However, our simulations for 
ubi,u,t.. fo. example, showed that it does .ove towarf the NS Q. Rece.t., assuming 
that Xu depends on the external force and using the Kramers theory, Dudko et al. have 
tried to go beyond the Bell-Evans-Rirchie approximation. They proposed jsl the following 
formula for dependence of the unfolding force on Xu and v. 



VXu 



AG* XuV 



(4) 



Here, AG"'^ is the unfolding barrier, u = 1/2 and 2/3 for the cusp [17| and the linear-cubic 
free energy surface jl8], respectively. 7 ~ 0.577 is the Euler-Mascheroni constant. Note 
that u = 1 corresponds to the phenomenological Bell theory (Eq. [3]). If z/ 7^ 1, then Eq. H] 
can be used to estimate not only Xu, but also G^. Since the fitting with u = 1/2 is valid 
in a wider force interval compared to the u = 2/3 case, we consider the former case only. 
The region, where the u = 1/2 fit works well, is expectantly wider than that for the Bell 
scenario (Fig. [8l). From the nonlinear fitting (Eq. Hj), we obtain Xui = 7. OA , and Xu2 = 9.7A 
which are about twice as large as the Bell estimates (Table 1). Using AFM data, Schlierf 
and Rief [l^, have shown that beyond Bell-Evans-Rirchie approximation Xu ~ llA. This 
value is close to our estimate for Xu2- However, a full comparison with experiments is not 
possible as these authors did not consider Xui and Xu2 separately. The present estimations 
of these quantities are clearly lower than the previous one ^ (Table 1). The lower values of 
X11, would be more favorable because they are expected to be not high for beta-rich proteins 



ISj like DDFLN4. Thus, beyond Bell-Evans-Rirchie approximation, the method based on 



Eq. H] provides more reasonable estimations for x^i compared to the method, where these 
parameters are extracted from unfolding rates However, in order to decide what method 
is better, more experimental studies are required. 

The corresponding values for G\, and G| are listed in Table 1. The experimental and 
previous theoretical results 4] are also shown for comparison. The present estimates for 
both barriers agree with the experimental data, while the previous theoretical value of AGf 
fits to experiments worse than the current one. 
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D. Thermal unfolding pathways 



In order to see if the thermal unfolding pathways are different from the mechanical ones, 
we performed zero-force simulations at T = 410 K. The progress variable 6 is used as a 
reaction coordinate to monitor pathways (see Materials and Methods). From Fig. [HI we 
have the following sequencing for strands and their pairs: 

G^{B,C,E)^{A,F,D), (5a) 

Paf Pbe ^ {Pcd, Pcf) ^ {PAB, Ppc, Pde)- (5b) 

It should be noted that these pathways are just major ones as other pathways are also 
possible. The pathway given by Eq. [50, e.g., occurs in 35% of events. About 20% of 
trajectories follow Paf ^ Pcf ^ Pbe ^ {Pcd, PAB, Ppc, Pde) scenario. We have also 
observed the sequencing Paf — ^ Pbe {Pcf, PAB, Pfg, Pde) Pen, and Pbe Paf 
{PcD, PCF, Pab, Peg, Pde) in 12% and 10% of runs, respectively. Thus, due to strong 
thermal fluctuations, thermal unfolding pathways are more diverse compared to mechanical 
ones. From Eqs. [Tal [Tb| [2al [2b| [Sal and [5bl it is clear that thermal unfolding pathways of 
DDFLN4 are different from the mechanical pathways. This is also illustrated in Fig. [9t. 
As in the mechanical case (Fig. [6^ and [6b), the contact between A and F is broken, but 
the molecule is much less compact at the same end-to-end distance. Although 7 contacts 
(~ 64%) between strands F and G remain survive, all contacts of pairs Paf, Pbe and Pcd 
are already broken. 

The difference between mechanical and thermal unfolding pathways is attributed to the 
fact that thermal fluctuations have a global effect on the biomolecule, while the force acts 



only on its termini. Such a difference was also observed for other proteins like 127 20|] and 



ubiquitin [lO, [2l|]. We have also studied folding pathways of DDFLN4 at T = 285 K. It 
turns out that they are reverse of the thermal unfolding pathways given by Eqs. [5aland[5bl 
It would be interesting to test our prediction on thermal folding/unfolding of this domain 
experimentally. 
Conclusions 

The key result of this paper is that mechnanical unfolding pathways of DDFLN4 depend 
on loading rates. At large v the C-terminal unfolds flrst, but the N-terminal unfolds at low 
V ~ 10^ nm/s. The agreement with the experiments is obtained only in low loading rate 
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simulations. The dependence of mechanical unfolding pathways on the loading rates was also 



Id, 



22| 



observed for 127 (M.S. Li, unpublished). On the other hand, the previous studies 
showed that mechanical unfolding pathways of the two-state ubiquitin do not depend on 
the force strength. Since DDFLN4 and 127 are three-state proteins, one may think that the 
unfolding pathway change with variation of the pulling speed, is universal for proteins that 
unfold via intermediates. A more comprehensive study is needed to verify this interesting 
issue. 

Dependencies of unfolding forces on pulling speeds have been widely used to probe FEL of 



two-state proteins 



23[. However, to our best knowledge, here we have made a first attempt 



to apply this approach to extract not only Xui, but also AG^ = 1, and 2) for a three-state 
protein. This allows us to improve our previous results jj]. More importantly, a better 
agreement with the experimental data 0, [isl suggests that this method is also applicable to 
other multi-state biomolecules. Our study clearly shows that the low loading rate regime, 
where FEL parameters can be estimated, occurs at f < 10^ nm/s which are about two-three 
orders of magnitude lower than those used in all-atom simulations. Therefore, at present, 
deciphering unfolding FEL of long proteins by all-atom simulations with explicit water is 
computationally prohibited. From this point of view, coarse-grained models are of great 
help. 

We predict the existence of a peak at AR ~ 1.5 nm even at pulling speeds used in 
now a day experimental setups. One of possible reasons of why the experiments did not 
detect this maximum is realted to a strong linker effect as a single DDFLN4 domain is 
sandwiched between Ig domains 127-30 and domains 131-34 from titin j^. Therefore, our 
result would stimulate new experiments on mechanical properties of this protein. Capturing 
the experimentally observed peak at AR ~ 22 nm remains a challenge to theory. 



2l\ l are different 



Mechanical unfolding pathways of DDFLN4 and other proteins llQ . 
from thermal ones. In accord with a common belief ^£4}, thermal unfolding pathways of these 
proteins were shown to be reverse of folding pathways. Therefore, their folding mechanisms 
can not be gained from mechanical studies. Recently, using the all-atom simulations with 



implicit solvent 



'2a l. it has been found that a 49-residue C-terminal of T0P7 (residues 2- 



50 of 2GJH.pdb) folds via a non-trivial caching mechanism \2dJ and its thermal unfolding 



pathways are not reverse of the folding ones 



'2' 



/. Can the folding mechanism of this fragment 



be deduced from mechanical unfolding simulations and experiments ? A detailed study of this 
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interesting question is in progress but our preliminary simulation results show that folding 
pathways may be inferred from the mechanical ones. 

The work was supported by the Ministry of Science and Informatics in Poland (grant No 
202-204-234). MK is very grateful to the Polish committee for UNESCO for the financial 
support. 
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Bell approximation 


Beyond Bell approximation 








Xui{A) 


Xu2{A) 


AGl/keT AGl/keT 


Theory [4] 


6.3 ± 0.2 


5.1 ± 0.2 


13.1 


12.6 


25.8 18.7 


Theory (this work) 


3.2 ± 0.2 


5.5 ± 0.2 


7.0 


9.7 


19.9 20.9 


Exp. [7, 19] 


4.0 ±0.4 


5.3 ± 0.4 






17.4 17.2 



Table 1. Parameters Xui, and Xu2 were obtained in the Bell and beyond-Bell approxi- 
mation. Theoretical values of the unfolding barriers were extracted from the microscopic 
theory of Dudko et al (Eq. H]) with v = 1/2. The experimental estimates were taken from 



Ref. 



4. 
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Figure Captions 



FIGURE 1. (a) Native state conformation of DDFLN4 taken from the PDB (PDB ID: 
Iksr). There are seven /5-strands: A (6-9), B (22-28), C (43-48), D (57-59), E (64-69), F 
(75-83), and G (94-97). In the native state there are 15, 39, 23, 10, 27, 49, and 20 native 
contacts formed by strands A, B, C, D, E, F, and G with the rest of the protein, respectively. 
The end-to-end distance in the native state R^s = 40.2 A. (b) There are 7 pairs of strands, 
which have the nonzero number of mutual native contacts in the native state. These pairs 
are Fab, Paf, Pre, Pcd, Pcf, Pde, and Pfg- The number of native contacts between them 
are 11, 1, 13, 2, 16, 8, and 11, respectively. 

FIGURE 2. Typical force-extension curves for v = 7.2 x 10^ nm/s (a), 6.4 x 10^ nm/s 
(b), 5.8 X 10"^ nm/s (c), and 2.6x10^ mn/s (d). The arrow in (c) and (d) roughly refers to 
locations of additional peaks for two trajectories (red and green). 

FIGURE 3. Distributions of positions of fmaxi and fmax2 for v = 7.2x 10^ (sohd), 6.4 x 10^ 
(dashed) , 5.8 x 10"^ (dotted) and 2.6x10"^ mn/s (dashed-dotted). 

FIGURE 4. (a) Dependence of averaged fractions of native contacts formed by seven 
strands on AR for v = 7.2 x 10^ nm/s. (b) The same as in (a) but for pairs of strands. 
Arrows refer to the positions of peaks. Results were averaged over 50 trajectories. 

FIGURE 5. The same as in Fig. |4]but for v = 2.6 x 10^ nm/s. Results were averaged over 
50 trajectories. 

FIGURE 6. (a) Typical snapshot obtained at AR = 2 nm and v = 7.2 x 10^ nm/s. A 
single contact between strand A (blue spheres) and strand F (orange) was broken (dotted 
lines). Native contacts between F and G (red) are also broken and G completely unfolds. 

(b) The same as in (a) but for v = 2.6 x 10^ nm/s. Native contacts between A and F and 
between B and E are broken (dotted lines), but all strands are remain partially structured. 

(c) Typical snapshot obtained at AR = 11 nm and v = 7.2 x 10^ nm/s. Native contacts 
between pairs are broken except those between strands A and B. All 11 unbroken contacts 
are marked by solid lines. Strands A and B do not unfold yet. (d) The same as in (c) but 
for V = 2.6 X 10"^ nm/s. Two from 11 native contacts between F and G are broken (dotted 
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lines). Contacts between other pairs are already broken, but F and G remain structured. 

FIGURE 7. (a) Force-extension curve for an anomalous unfolding pathway at f = 2.6 x 10^ 
nm/s. (b) Dependence of fractions of native contacts of seven strands on AR. Snapshot at 
AR = 7.4 nm (c) and AR = 11 nm (d). 

FIGURE 8. Dependence of fmaxi (open circles) and fmax2 (open squares) on v. The 
values of these peaks were obtained as averages over all trajectories. The arrow separates 
the low pulling speed regime from the high one. Straight lines are fits to the Bell-Evans- 
Rirchie equation {y = —20.33 + 11. 424/n(x) and y = 11.54 + 6.528/?i(x) for Fmaxi and -Fmax2, 
respectively). Here fmax and v are measured in pN and nm/s, respectively. From these fits 
we obtain Xui = 3.2A and Xu2 = 5.5A. The solid circle and triangle correspond to fmaxi ~ 40 
pN and fmax2 ~ 46 pN, obtained by interpolation of linear fits to the experimental value 
V = 200 nm/s. Fitting to the nonlinear microscopic theory (dashed lines) gives Xui = 
7.0A,AGI = 19.9kBT,Xu2 = 9.7A, and AG| = 20.9kBT. 

FIGURE 9. Thermal unfolding pathways, (a) Dependence of native contact fractions of 
seven strands on the progress variable 6 aX T = 410 K. (b) The same as in (a) but for seven 
strand pairs, (c) A typical snapshot at AR ^1.8 nm. The contact between strands A and F 
is broken (dotted lines) but 7 contacts between strands S6 and S7 (solid lines) still survive. 
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